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Abstract 

Multireader shared registers are basic objects used as communication medium in asynchronous 
concurrent computation. We propose a surprisingly simple and natural scheme to obtain several 
wait-free constructions of bounded 1-writer multireader registers from atomic 1-writer 1-reader 
registers, that is easier to prove correct than any previous construction. Our main construction 
is the first symmetric pure timestamp one that is optimal with respect to the worst-case local 
use of control bits; the other one is optimal with respect to global use of control bits; both are 
optimal in time. 

1 Introduction 

Interprocess communication in distributed systems happens by either message-passing or shared 
variables (also known as shared registers). Lamport [201 argues that every message-passing system 
must hide an underlying shared variable. This makes the latter an indispensable ingredient. The 
multireader wait-free shared variable is (even more so than the multiwriter construction) the building 
block of virtually all bounded wait-free shared- variable concurrent object constructions, for example, 
EmnilSllHlEllZlIIllHinilini- Hence, understanding, simplicity, and optimality of bounded 
wait-free atomic multireader constructions is a basic concern. Our constructions are really simple 
and structured extensions of the basic unbounded timestamp algorithm in |^, based on a natural 
recycling scheme of obsolete timestamps. 

Asynchronous communication: Consider a system of asynchronous processes that communi- 
cate among themselves by executing read and write operations on a set of shared variables only. The 
system has no global clock or other synchronization primitives. Every shared variable is associated 
with a process (called owner) which writes it and the other processes may read it. An execution of 
a write (read) operation on a shared variable will be referred to as a Write (Read) on that variable. 
A Write on a shared variable puts a value from a pre determined finite domain into the variable, and 
a Read reports a value from the domain. A process that writes (reads) a variable is called a writer 
(reader) of the variable. 

Wait-free shared variable: We want to construct shared variables in which the following two 
properties hold. (1) Operation executions are not necessarily atomic, that is, they are not indivisible, 
and (2) every operation finishes its execution within a bounded number of its own steps, irrespective 
of the presence of other operation executions and their relative speeds. That is, operation executions 
are wait- free. These two properties give rise to a classification of shared variables: (i) A safe variable 
is one in which a Read not overlapping any Write returns the most recently written value. A Read 
that overlaps a Write may return any value from the domain of the variable; (ii) A regular variable is 
a safe variable in which a Read that overlaps one or more Writes returns either the value of the most 
recent Write preceding the Read or of one of the overlapping Writes; and (iii) An atomic variable is 
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a regular variable in which the Reads and Writes behave as if they occur in some total order which 
is an extension of the precedence relation. 

Multireader shared variable: A multireader shared variable is one that can be written by one 
process and read (concurrently) by many processes. Lamport [201 constructed an atomic wait-free 
shared variable that could be written by one process and read by one other process, but he did not 
consider constructions of wait-free shared variables with more than one writer or reader. 

Previous Work: In 1987 there appeared at least five purported solutions for the wait-free 
implementation of 1-writer n-reader atomic shared variable from atomic 1-writer 1-reader shared 
variables: 19,.2S10]|^ and the conference version of of which was shown to be incorrect in 
[5] and only |27| appeared in journal version. The only other 1-writer n-reader atomic shared variable 
constructions appearing in journal version are |lUj and the "projection" of the main construction in 
A selection of related work is 29, 2l 13 El HOI HH US El [13 EHI EH E2 E3 BS , 26, 27, 28 . 
A brief history of atomic wait-free shared variable constructions and timestamp systems is given in 

[El- 

Israeli and Li jl5| introduced and analyzed the notion of timestamp system as an abstraction of 
a higher typed communication medium than shared variables. (Other constructions are |3 1161 13 
151 [71 IT^.) As an example of its application, | 15| presented a non-optimal multireader construction, 
partially based on |21|, using order control bits overall, and order n control bits locally for each 
of 0{n^) shared 1-reader 1-writer variables. Our constructions below are inspired by this timestamp 
method and exploit the limited nature of the multireader problem to obtain both simplification and 
optimality. 

This work: We give a surprisingly simple and elegant wait-free construction of an atomic multi- 
reader shared variable from atomic 1-reader 1-writer shared variables. Other constructions (and our 
second construction) don't use pure timestamps jl5j but divide the timestamp information between 
the writer's control bits and the reader's control bits. Those algorithms have an asymmetric burden 
on the writer-owned subvariables. Our construction shows that using pure timestamps we can bal- 
ance the size of the subvariables evenly among all owners, obtaining a symmetric construction. The 
intuition behind our approach is given in Section [3 Technically: 

(i) Our construction is a bounded version of the unbounded construction in j29| restricted to 
the multireader case (but different from the "projection" of the bounded version in |12j)i and its 
correctness proof follows simply and directly from the correctness of the unbounded version and a 
proof of proper recycling of obsolete timestamps. 

(ii) The main version of our construction is the first construction that uses a sublinear number 
of control bits, O(logn), locally for everyone of the rt^ constituent 1-reader 1-writer subvariables. It 
is easy to see that this is optimal locally, leading to a slightly non-optimal global number of control 
bits of order n^logri. Implementations of wait-free shared variables assume atomic shared (control) 
subvariables. Technically it may be much simpler to manufacture in hardware — or implement in 
software — small 0(logn)-bit atomic subvariables that are robust and reliable, than the exponentially 
larger 0(n)-bit atomic subvariables required in all previous constructions. A wait-free implementation 
of a shared variable is fault-tolerant against crash failures of the subvariables, but not against more 
malicious, say Byzantine, errors. Slight suboptimality is a small price to pay for ultimate reliability. 

(iii) Another version of our construction uses n 1-writer 1-reader variables of n control bits and 
1-writer 1-reader variables of 2 control bits each, yielding a global 0{n^) number of control bits. 

There need to be v? 1-reader 1-writer subvariables for pairwise communication, each of them using 
some control bits. Hence the global number of control bits in any construction is VL{n^). We also 
reduce the number of copies of the value written to 0{n) rather than 0{n^). All these variations of our 
construction use the minimum number 0{n) of accesses to the shared 1-reader 1-writer subvariables, 
per Read and Write operation. 

Our construction corrects [3D] by merging pairs Ri^n+i of shared variables in that paper into 
a single shared variable Ri^n here (0 < i < n). We give an informal argument why the construction 
works correctly; the formal argument is deferred to a later treatment in I/O automata assertional 
framework surveyed in |18j . 
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1.1 Model, Problem Definition, and some Notations 

Throughout the paper, there are n readers indexed 0, 1, . . . , n — 1 and a single writer indexed n. The 
multireader variable constructed will be called ABS (for abstract). 

A construction consists of a collection of atomic 1-reader 1-writer shared variables Rij-, i,j S 
{0, 1, . . . , n}, written by process i and read by process j, thus providing a communication path from 
user i to user j. We say that the writer, process i, owns the shared variable Stricly speaking, 

the "diagonal" shared variables Ri^i {0 < i < n) are superfluous since process i can remember in a 
local variable what it wrote last to a shared variable it owns. We keep the diagonal shared variables 
nonetheless, since doing so allows us to simplify the protocols. There are two global procedures. Read 
and Write ^ to execute a read operation or write operation on ABS , respectively. Each of these are 
expressed in terms of multiple reads and writes on the shared variables involved. Both procedures 
have an input parameter i, which is the index of the executing user, and in addition. Write takes a 
value to be written to ABS as input. A return statement must end both procedures, in the case of 
Read having an argument which is taken to be the value read from ABS. 

A procedure contains a declaration of local variables and a body. A local variable appearing in 
can be declared static, which means it retains its value between procedure invocations. The body is a 
program fragment comprised of atomic statements. Access to shared variables is naturally restricted 
to assignments from Rj i to local variables and assignments from local variables to Rij, for any j 
(recall that i is the index of the executing user) . No other means of inter-process communication is 
allowed. In particular, no synchronization primitives can be used. Assignments to and from shared 
variables are called writes and reads respectively, always in lower case. The space complexity of a 
construction is the maximum size, in bits, of a shared variable. The time complexity of the Read or 
Write procedure is the maximum number of shared variable accesses in a single execution. 

1.2 Correctness 

A wait-free construction must satisfy the following constraint: Wait-Preedom: Each procedure must 
be free from unbounded loops. Given a construction, we are interested in properties of its executions, 
which the following notions help formulate. A state is a configuration of the construction, comprising 
values of all shared and local variables, as well as program counters. In between invocations of the 
Read and Write procedure, a user is said to be idle, and its program counter has the value 'idle'. 
One state is designated as initial state. All users must be idle in this state. 

A state t is an immediate successor of a state s if t can be reached from s through the execution of 
a procedure statement by some user in accordance with its program counter. Recall that n denotes 
the number of readers of the constructed variable ABS. A state has precisely n + 1 immediate 
successors: there is at precisely one atomic statement per process to be executed next (each process 
is deterministic). 

A history of the construction is a finite or infinite sequence of states tQ,ti, . . . such that to is the 



initial state and i^+i is an immediate successor of t^. Transitions between successive states are called 
the events of a history. With each event is associated the index of the executing user, the relevant 
procedure statement, and the values manipulated by the execution of the statement. Each particular 
access to a shared variable is an event, and all such events are totally ordered. 

The (sequential) time complexity of the Read or Write procedure is the maximum number of 
shared variable accesses in some such operation in some history. 

An event a precedes an event b in history h, a -<h h, if a occurs before h in h. Call a finite set 
of events of a history an event-set. Then we similarly say that an event-set A precedes an event-set 
S in a history, A -<h B, when each event in A precedes all those in B. We use a b to denote 
that either a —h 5 or a ~<h b. The relation -<h on event-sets constitutes what is known as an interval 
order. That is, a partial order -< satisfying the interval axiom a~<bAc~<dAcy^b=>a^d. This 
implication can be seen to hold by considering the last event of c and the earliest event of b. See PU] 
for an extensive discussion on models of time. 

Of particular interest are the sets consisting of all events of a single procedure invocation, which 
we call an operation. An operation is either a Read operation or a Write operation. It is complete 
if it includes the execution of the final return statement of the procedure. Otherwise it is said to 
be pending. A history is complete if all its operations are complete. Note that in the final state of 
a complete finite history, all users are idle. The value of an operation is the value written to ABS in 
the case of a Write, or the value read from ABS in the case of a Read. 

The following crucial definition expresses the idea that the operations in a history appear to 
take place instantaneously somewhere during their execution interval. A more general version of 
this is presented and motivated in ^3]. To avoid special cases, we introduce the notion of a proper 
history as one that starts with an initializing Write operation that precedes all other operations. 
Linearizability: A complete proper history h is linearizable if the partial order -<h on the set of 
operations can be extended to a total order which obeys the semantics of a variable. That is, each 
Read operation returns the value written by that Write operation which last precedes it in the total 
order. We use the following definition and lemma from [221 : 

Definition 1 A construction is correct if it satisfies Wait-Freedom and all its complete proper his- 
tories are linearizable. 

Lemma 1 A complete proper history h is linearizable iff there exists a function mapping each op- 
eration in h to a rational number, called its timestamp, such that the following 3 conditions are 
satisfied: 

Uniqueness: different Write operations have different timestamps. 

Integrity: for each Read operation there exists a Write operation with the same timestamp and 
value, that it doesn't precede. 

Precedence: if one operation precedes another, then the timestamp of the latter is at least that 
of the former. 

2 Intuition 

We use the following intuition based on timestamp systems: The concurrent reading and writing of 
the shared variable by one writer and n readers (collectively, the players) is viewed as a pebble game 
on a finite directed graph. The nodes of the graph can be viewed as the timestamps used by the 
system, and a pebble on a node as a subvariable containing this timestamp. If the graph contains 
an arc pointing from node a to node b then a is dominated by 6 (a < 6). The graph has no cycles 
of length 1 or 2. First, suppose that every player has a single pebble which initially is placed at a 
distinguished node that is dominated by all other nodes. A Read or Write by a player consists in 
observing where the pebbles of the other players are, and determining a node to move its own pebble 
to. In a Write, the writer puts its pebble on a node that satisfies all of the following: (i) it dominates 
the previous position; (ii) it is not occupied by a pebble of a reader; and (iii) it is not dominated 



by a node occupied by a pebble of a reader. In a Read, the reader concerned puts its pebble at 
a node containing the writer's pebble. In the pebble game a player can only observe the pebble 
positions of the other players in sequence, one at a time. By the time the observation sequence is 
finished, pebbles observed in the beginning may have moved again. Thus, in an observation sequence 
by a reader, a node can contain another reader's pebble that dominates the node containing the 
writer's pebble. Then, the second reader has already observed the node pebbled by a later Write 
of the writer and moved its pebble there. In the unbounded timestamp solution below, where the 
timestamps are simply the nonnegative integers and a higher integer dominates a lower integer, this 
presents no problem: a reader simply moves its pebble to the observed pebbled node corresponding 
to the greatest timestamp. But in the bounded timestamp solution we must distinguish obsolete 
timestamps from active recycled timestamps. Clearly, the above scenario cannot happen if in a Read 
we observe the position of the writer's pebble last in the obervation sequence. Then, the linearization 
of the complete proper history of the system consists of ordering every Read after the Write it joined 
its pebble with. This is the basic idea. But there is a complication that makes life less easy. 

In the implementation of the 1-writer n-reader variable in 1-writer 1-reader subvariables, every 
subvariable is a one-way communication medium between two processes. Since two-way commu- 
nication between every pair of processes is required, there are at least (n -|- l)n such subvariables: 
Every process owns at least n subvariables it can write, each of which is read by one of the other n 
processes. Thus, a pebble move of a player actually consists in moving (at least) n pebble copies, 
in sequence one at a time — a move of a pebble consists in atomically writing a subvariable. Every 
pebble copy can only be observed by a single fixed other player — the reader of that 1-writer 1-reader 
subvariable. This way player may move the pebble copy associated with player 1 to another node 
(being observed by player 1 at time ti) while the pebble copy associated with player 2 is still at 
the originating node (being observed later by player 2 at time t2 > ii). This once more opens the 
possibility that a reader sees a writer's pebble at a node pebbled by a Write, while another reader 
earlier on sees a writer's pebble at a node pebbled by a later Write. The following argument shows 
that this 'later Write' must be in fact the 'next Write': A reader always joins its pebble to a node 
that is already pebbled, and only the writer can move its pebbles to an unoccupied node. Therefore, 
the fact that a reader observes the writer's pebble last guaranties that no reader, of which the pebble 
was observed before, can have observed a Write's pebble position later than the next Write. Namely, 
at the time the writer's pebble was observed by the former reader at the node pebbled by Write, the 
next Write wasn't finished, but the latter reader has already observed the writer's pebble. This fact 
is important in the bounded timestamp solution we present below, since it allows us to use a very 
small timestamp graph that contains cycles of length just 3: G — {V, E) with V = {±, 1, . . . , 4n + 3}^ 
and (wi,W2) £ E [vi < V2) iff either vi = {i,j),V2 = {k,h) and j = k and i^h 7^_L, or = (^, ^) 
and h 

This approach suffices to linearize the complete proper history of the system if we can prevent a 
Write to pebble the same node as a Read in the process of chasing an older obsolete Write. This we 
accomplish by a Read by player i announcing its intended destination node several times to the writer, 
with an auxiliary pebble for the purpose (special auxiliary subvariable) intended for this purpose, 
and in between checking whether the writer has moved to the same node. In this process either the 
Read discovers a Write that is intermediate between two Writes it observed, and hence covered by 
the Read, in which case it can safely report that Write and choose the bottom timestamp (_L,_L). 
Alternatively, the Read ascertains that future Writes know about the timestamp it intends to use 
and those Writes will avoid that timestamp. 

3 Unbounded 

Figure^shows Construction 0, which is a restriction to the multireader case of the unbounded solution 
multiwriter construction of |25]. Line 2 of the Write procedure has the same effect as "/ree :— free + 1" 
with free initialized at (because the writer always writes free to Rn,n-tag in line 4). The processes 
indexed 0, . . . , ?i — 1 are the readers and the process indexed n is the writer. We present it here as 



an aid in understanding Construction 1. Detailed proofs of correctness (of the unrestricted version, 
where not just process n can Write, but every process can do so) are given in '29' and essentially 
simple proofs in 22 and the textbook |2ij|. 
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procedure Read{i) 
var J, max : I 

from : array[0..7i] of shared 
begin 

1 for i :— Q..n do from[j] := Rj^i 

2 select max such that Vj : from[max].tag > from[j].tag 

3 from[i\ := from[max] 

4 for j := Q..n do Ri j :— from[i] 

5 return from [i] . value 
end 



Figure 1: Construction 

The timestamp function called for in lemmas] is built right into this construction. Each operation 
starts by collecting value-timestamp pairs from all users. In line 3 of either procedure, the operation 
picks a value and timestamp for itself. It finishes after distributing this pair to all users. It is not hard 
to see that the three conditions of lemma^are satisfied for each complete proper history. Integrity and 
Precedence are straightforward to check. Uniqueness follows since timestamps of Write operations of 
the single writer strictly increase (based on the observation that each Ri^i.tag is nondecreasing) . 

Restricting our unbounded timestamp multiwriter algorithm of |29| . in the version of J5^2.,, to 
the single writer case enabled us to tweak it to have a new very useful property that is unique to 
the multireader case: The greatest timestamp scanned by a reader is either the writer's timestamp 
from[n\.tag or another reader's timestamp that is at most 1 larger. The tweaking consists in the fact 
that there is a definite order in the scanning of the shared variables in line 2 of the Read procedure: 
the writer's shared variable is scanned last (compare Section [21 . 

Lemma 2 The max selected in line 2 of the Read procedure satisfies from[n].tag < from[max].tag < 
from[n].tag + 1. 

Proof. At the time the writer's shared variable Rn,i is scanned last in line 2 of the Read{i) 
procedure, yielding from[n].tag, the writer can have started its next write, the {from[n\.tag + l)th 
Write, but no write after that — otherwise the from[n].tag would already have been overwritten in 



Rn.i by the {from[n].tag + l)th Write. Henee, a timestamp scanned from another reader's shared 
variables Rj^i {j ^ n,i) can exceed the writer's timestamp by at most 1. • 

4 Bounded 

The only problem with Construction is that the number of timestamps is infinite. With a finite 
number of timestamps comes the necessity to re-use timestamps and hence to distinguish old times- 
tamps from new ones. Our strategy will be as follows. We will stick very close to construction-0 and 
only modify or expand certain lines of code. The new bounded timestamps will consist of two fields, 
like dominoes, the tail field and head field. 

Definition 2 A boimdcd timestamp is a pair {t, h) where t is the value of the tail field and h is the 
value of the head field. Every field can contain a value t with < t < 4n -|- 2 or t is the distinguished 
initial, or bottom, value ±. (J^ is not a number and is lower than any number.) We define the 
domination relation "<" on the bounded timestamps as follows: {ti,hi) < {to, ho) if either hi = to 
and ti ^ ho 7^_L, or ti, hi =_L and ho t^-L. 

The matrix of shared variables stays the same, but now every Rij {0 < i,j < n) contains a record 
consisting of a value field, and a bounded timestamp consisting of a tail field and a head field. The 
purpose is that process j can see (read) whether process i has executed a Read {0 < i < n) or Write 
{i = n) with what timestamp. The readers use this information to select an appropriate "latest" 
Write value to return in their Read. The shared variables for communication between readers 
i {0 < i < n — 1) and the writer n contains in addition a second such record. This second record 
is used by the readers to inform the writer of timestamps that are not obsolete, in the sense that 
they still exist in the local variables of the reader even though they may have already disappeared 
from every shared variable in the system. These timestamps may possibly be written to a shared 
variable owned by the reader involved, and hence cannot be recycled yet. The scan executed in line 
1 of the Write protocol gathers all timestamps written in both records of the i?i,„'s (0 < z < n). 
These constitute all the timestamps that are not obsolete, and hence the process can determine the 
< 4n -|- 2 values occurring in the fields (two per timestamp, one timestamp per record, for 2n -|- 1 
records), and select a value that doesn't occur (there are 4n + 3 values available exclusive of the 
bottom value _L). The initial state of the construction has all records in the Ri,j's set to (0, _L, _L). 

The lines of Construction- 1 are numbered maintaining — or subnumbering — the corresponding 
line numbers of Construction-0. The only real difference in the Write protocols is line 2 (select new 
timestamp). In the Read protocols the differences are lines 2.x (determine latest — or appropriate — 
timestamp) and lines 3.x (assign selected timestamp to local variable from[i]), and lines 0.x (start 
Read by reading writer's timestamp and writing it back to writer). According to this scheme we 
obtain the out-of-order line-sequence 2.4, 3.1, 2.5, 3.2 because the instructions in lines 2 and 3 of 
Construction-0 are split and interleaved in Construction- 1. 

Lemma 3 Line 2 of a Write always selects a free integer hi such that the timestamp (ii,/ii) with 
new head hi and new tail ti (head of the timestamp of the directly preceding Write), assigned in 
line 3 and written in line 4 of that Write, satisfies {ti, hi) -ft {to, ho) and {ti, hi) ^ {to, ho) for every 
timestamp {to, ho) either scanned in line 1 of the Write, or presently occurring in a local variable of 
a concurrent Read that will eventually be written to a shared variable in line 4 of that Read. 

Proof. Intuition: In line 1 of a Write the shared variables with each reader, and a redundant 

shared variable with itself, are scanned in turn. In assigning a value to free in line 2 the writer 
avoids all values that occur in the head and tail fields of the shared variables. All local variables 
of a reader are re-assigned from shared variables in executing the Read procedure, before they are 
written to shared variables. So the only problem can be that free occurs in a local variable of a 
concurrently active Read that has not yet written it to a variable shared with the writer at the time 
that variable was scanned by the Write. If free exists in a local variable temp this doesn't matter 
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procedure Read{i) 
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from : array [0..n] of shared 
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end 



Figure 2: Construction 1 



since the timestamp concerned will not be used as a Read timestamp. Hence, the only way in which 
free can be assigned a value that concurrently exists in a local variable of a Read which already has 
been used or eventually can be used in selection line 2.5 of a Read, and hence eventually can be 
part of an offending timestamp written to a shared variable, is according to the following scenario: 
A Read{i), say R, is active at the time a Write, say Wi, reads either one of their shared variables. 
This R will select or has already selected the offending timestamp {tQ,ho), originally written by a 
Write Wo preceding Wi, but R has not yet written it to i?i.„+i or i?i_„ by the times Wi scans 
those variables. Moreover, Wi will in fact select a timestamp {ti,hi) with either < {to, ho) 

or (ti,hi) = (to, ho)- This can only happen if free in line 2 is to, or ho and the to = head of the 
timestamp of the Write immediately preceding Wi , respectively. Then, a later Read Rq using in its 
line 2.5 the (ti,hi) timestamp written by Wi and the (to, ho) timestamp written by R concludes 
falsely that Wi precedes Wo or Wi = Wo- For this to happen, R has to write {to, ho) in Hne 4, and 
has to assign it previously in one of lines 3.1, 3.2, or 3.3. In line 3.1 the timestamp assigned is the 
bottom timestamp (_L, _L) which by definition cannot dominate or equal a timestamp assigned by 
the writer. This leaves assignement of the offending timestamp {to, ho) in line 3.2 or line 3.3. This 
also leads to a contradiction of which the proof is deferred to a future paper using the I/O automata 
formalism. • 
The crucial feature that makes the bounded algorithm work is the equivalent of lemma |2J 

Lemma 4 The timestamp selected in lines 2-x and written in line 4 of a Read{i) is one of the 
following: 

(i) The timestamp (±,±) for a Write that is completely overlapped by the Read concerned; 

(ii) Another reader's timestamp scanned in line 1 (respectively, line 2.3) that is written by the 
next Write after the Write that wrote the timestamp from[n\.{tail, head) scanned in line 1- 

(iii) Otherwise, the writer's timestamp from[n\. {tail, head) scanned in line 1. 

Proof. Intuition: Only assignments in lines 3.1, 3.2, 3.3 can result in a write in line 4. 

(i) If line 3.1 is executed in a Read, then previously we scanned the writer's timestamp in lines 
0.1, 1, 2.3, and obtained three successive timestamps without two successive ones being the same. 
Hence the Write corresponding to the middle scan, of line 1, is overlapped completely by the Read, 
and the Read can be ordered directly after this Write, and this has no consequences for the remaining 
ordering. This is reflected by using the timestamp (_L, _L) to be written by such a Read in line 4. 

(ii) If line 3.2 is executed in a Read then the timestamp assigned is a reader's timestamp scanned 
in line 1 or line 2.3. The writer's timestamp used in the comparison line 2.5 is, say, {t, h). According 
to the semantics of the "<" sign in definition [21 only a reader's timestamp of the form {h,free) 
satisfies the condition in line 2.5. The only timestamps in the system are created by the writer. By 
Lemma 13 existence of the {h,free) timestamp somewhere in the system at the time of writing the 
current instance of timestamp {t,h) would have prevented the writer from writing {t,h). Thus the 
writer must have written the {h,free) timestamp after it wrote {t, h). Using Lemma|31a second time, 
the writer can write a timestamp with h in the tail field only at the very next Write after the Write 
that wrote a timestamp with h in the head field, since the {t, h) timestamp is still somewhere in a 
shared variable or to be written to a shared variable. 

(iii) If line 3.3 is executed in a Read, then items (i) and (ii) were not applicable and the timestamp 
assigned is the writer's timestamp scanned in line 1. • 

Theorem 1 Construction- 1 is a wait-free implementation of an atomic 1-writer n-reader register. 
It uses (n + l)(n + 2) — 1 atomic 1-writer 1-reader 21og(4ri + 4) bits control shared variables. The 
Write scans 2n + 1 of these variables and writes n + 1 of them; the Read scans < 2n + 3 of these 
variables and writes < n + 3 of them. 

Proof. Complexity: Since the program of Construction- 1 contains no loops, it is straightfor- 
ward to verify that every Read and Write executes the number of accesses to shared variables, and 
the size of the shared variables, as in the statement of the theorem. 



Wait-Freedom: This follows directly from the upper bounds on the complexity (scans and 
writes) of the shared variables in the construction, and the fact that the program of Construction- 1 
is loop-free. 

Linearizability: Consider a complete proper history (as defined before) h of all Writes and only 
those Read's that don't write the bottom timestamp (_L, _L) in line 4. Let be the partial order 
induced by the timing of the Reads and Writes of h. Lemma 0] items (ii) and (iii), asserts that on 
this history h, Construction-0 and Construction- 1 behave identically in that the same Read reports 
the values of the same Write in both constructions. Therefore, with respect to h, linearizability of 
Construction- 1 follows from the linearizability of Construction-0. Let be the linear order thus 
resulting from ~<h- The remaining Read's, those that write the bottom timestamp (_L,_L) in line 4, 
completely overlap a Write, lemma^item (i), and report the value of that overlapped Write. Hence 
they can, without violating linearizability, be inserted in the -<^-order directly following the Write 
concerned. This shows that Construction- 1 is linearizable. • 

Minimum Number of Global Control Bits: The same algorithm with only 0{n^) control 
bits overall can be constructed as follows. Each register owned by the writer contains 2n control 
bits, and each register owned by a reader contains only 2 control bits. The control bits are used to 
determine the domination relation between readers and the writer. The Protocol stays the same, 
only the decisions in the protocol are made according to different format data. Since the decisions 
are isomorphic with that of Protocol 1, the correctness of the new Protocol follows by induction on 
the total atomic order of the operation executions in each run by the correctness of Construction- 1. 

Minimum Number of Replicas of Stored Values: In the algorithm, each subregister os- 
tensibly contains a copy of the value to be written. This sums up to 0{n^ logT^) bits, for the value 
ranging from 1 to V. With the following scheme only the registers owned by the writer contain the 
values. Each register owned by the writer can contain two values. The two fields concerned are used 
alternatingly. The writer starts its tth write with an extra write (line 0) to all registers it owns, 
writing the new value in field t (mod 2). In line 4 it writes to field t (mod 2) (it marks this field 
as the last one written), and finishes by setting t := {t + 1) (mod 2). The readers, on the other 
hand, now write no values, only the timestamps. If the reader chooses the writer, it takes the value 
from the marked field; if it chooses a reader, it takes the value from the unmarked field. Since no 
observed reader can be more than one write ahead of the actually observed write, this is feasible while 
maintaining correctness. This results in 0(n) replications of the value written resulting in a total of 
0{n\ogV) value bits. This is clearly the optimal order since the writer needs to communicate the 
value to everyone of the readers through a separate subregister. (Consider a schedule where every 
reader but one has fallen asleep indefinitely. Wait-freeness requires that the writer writes the value 
to a subregister being read by the active reader.) 
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